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Abstract 

If  a  matrii  pencil  A-X5  is  known  only  to  within  a  tolerance  t  (because  of  measurement  or 
round-off  errors),  ±en  it  may  be  difficult  to  compute  a  generalized  eigendecoraposition  of 
A  —  \B  since  its  eigenspaces  arc  discontinuous  functions  of  its  entries.  We  are  interested  in 
computing  an  eigendeccmposition  of  A  — X^  which  varies  continuously  and  boundedly  as 
A-kB  varies  inside  a  ball  of  radius  e.  There  arc  two  cases  with  qualitatively  different 
solutions.  The  first  case  i:,  when  A-\B  is  regular,  i.e.  det(A-X^)  is  not  identically  zero.  In 
this  case  we  show  how  tc  partition  the  spectrum  of  A-X^  into  disjoint  pieces  which  remain 
disjoint  and  whose  assoda.rd  eigenspaces  vary  smoothly.  The  second  case  is  when  A-VS  is 
singular  (i.e.  either  det(A->u9)»0  or  A-XB  is  ncnsquare).  This  case  is  more  difficult  than 
±e  first  because  applications  call  for  computing  nongeneric  eigenspaces  which  erist  only 
when  A-\3  lies  in  a  proper  variety  (a  set  of  measure  zero).  The  known  algorithms  for 
computing  these  nongenenc  structures  produce  the  eigendeccmposition  of  a  pencil  close  to 
the  input  where  this  perturbed  pencil  is  guaranteed  to  lie  in  the  proper  variety  mentioned 
above.  In  this  case  we  prove  tiiat  as  long  as  ±e  norm  of  the  perturbations  produced  by  the 
algorithm  are  smaller  than  a  certain  e  we  can  compute  from  the  penal,  the  resulting 
nongeneric  eigenspaces  produced  by  the  algorithm  vary  smoothly.  We  illustrate  this  theorem 
with  an  example  from  systems  theory:  we  derive  perturbation  bounds  for  the  ccatrcllable 
subspace  and  uncontrollable  modes  of  a  system  x  =  Cx^Du. 


.•■     ,  /  -:.■..    1 


'i-Ti''' 


z 


1.  Introdactlon. 

If  we  are  given  a  complex  m  by  n  matrix  pendl  A  —  \B  which  we  only  know  to  within  a 
tolerance  €>0,  what  does  it  mean  to  compute  an  eigendecomposition  of  A-^Bl  By  only 
knowing  A  —  kB  to  a  tolerance  t  we  mean  that  A-XJ  is  indistinguishable  from  any  pencil  in 
the  set 

P(€)-{A  +  £-X(5+/-):l|(£^)||£<€}    . 

An  eigendecomposition  of  A  -  XJ3  will  be  written 

A-\B  =  P(S-\T)Q-^  (1.1) 

where  P  is  an  m  hy  m  nonsingular  matrix,  2  is  an  n  by  n  nonsinguJar  matrix,  and  S  and  T 

are  block  diagonal:  5  =  diag(5;; Si,i,)  and  r=diag(r;.,  .  .  .  .Tj,^).  We  can  group  the 

columns  of  P  into  blocks  corresponding  to  the  blocks  of  S-\T:  P  =  [Pi  I  •  •  •  I  Pt,]  where  P, 
is  m  by  m,,  mi  being  the  number  of  rows  of  5,;- XT,,.  Similarly,  we  can  group  the  columns  of 
Q  into  blocks  corresponding  to  the  blocks  of  S-\T:  Q  =  [Qi\  ■  •  ■  |Ci]  where  2/  is  n  by  n,,  n, 
being  the  number  of  columns  of  5,;- XT;,. 

The  diagonal  blocks  5,, -XT,;  contain  information  about  the  generalized  eigenstructure 
of  the  pencil  A-\B  and  P,  and  Q,  contain  information  about  the  corresponding  generalized 
eigenspaces.  One  canonical  decomposition  of  the  form  (1.1)  we  will  refer  to  is  the  Kronecker 
Canonical  Form  or  KCF  [Gantmacl;er],  where  each  block  5,, -XT;,  must  be  of  one  of  the 
following  forms: 
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Xn-X 


This  is  simply  a  Jordan  block.  Xq  is  called  a  finite  eigenvalue  of  A-X5. 

1   X 


S,-\T,,  = 


This  block  corresponds  to  an  infinite  eigenvalue  of  multiplicity  equal  to  the  dimension  of  the 
block.  The  blocks  of  finite  and  infinite  eigenvalues  together  constitute  the  regular  part  of  the 
pendl. 
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This  kby  k+l  block  is  called  a  singular  block  of  minimal  right  (or  column)  index  k.  It  has  a 
one  dimensional  right  null  space  for  any  X. 


■J//     X/„  —  L, 


This  ;'  +  l  by  ;  block  is  called  a  singular  block  of  minimal  left  (or  row)  index  ;.  It  has  a  one 
dimensional  left  null  space  for  any  X.  The  left  and  right  singular  blocks  together  constitute 
the  singular  part  of  the  pencil. 


We  would  like  to  produce  an  eigendecomposition  which  is  valid  in  some  way  for  all 
pencils  in  P(€),  and  gives  as  much  information  about  all  pencils  in  P(£)  as  possible.  The  kind 
of  information  such  a  decomposition  can  provide  will  depend  on  whether  the  pencil  A- KB  is 
regular  or  singular.  A-XJ9  is  regular  if  A-kB  is  square  and  dct{A-\3)i=0  for  some  X.  This 
is  equivalent  to  A-X_S  having  only  a  regular  part  in  its  KCF.  A-kB  is  singular  if  either 
A-XB  is  square  and  det(A-X5)  =  0  for  all  X,  or  else  A-\B  is  nonsquare.  This  is  equivalent 
to  A-kB  having  a  singular  part  in  its  KCF  [Gantmacher]. 

In  the  regular  case,  A-kB  has  n  generalized  eigenvalues  which  may  be  finite  or 
infinite.  The  diagonal  blocks  of  S-XT  partition  the  spectrum  of  A-kB  as  follows: 


cr-'aiA-kB)  -  U  «^(5/;->^2",/)  "  U  <^( 
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The  subspaces  spanned  by  P,  and  Qi  are  called  left  and  right  deflating  subspaces  of  A-kB 
corresponding  to  the  part  of  the  spectrxim  ct,  [Stewart3,Van  Dooren3].  As  shown  in  [Van 
DoorenS],  a  pair  of  subspaces  P  and  Q  is  deflating  for  A  —  kB  if  P=i4Q  +  flQ  and 
dim(Q)  =  dim(P).  They  are  tJie  generalization  of  invariant  subspaces  for  the  standard 
eigenvalue  problem  A-Xi  to  the  regular  pencil  case:  Q  is  a  (right)  invariant  subspace  of  A  if 
Q=AQ  +  Q,  i.e.  AQCQ. 

To  illustrate  and  motivate  our  approach  to  regular  pencils,  we  indicate  how  we  would 
decompose  P(€)  for  various  values  of  e,  where  A  — X_S  is  given  by 


A-kB  - 


where  -q  is  a  small  number.  It  is  easy  to  see  that  cr={l/Ti,0,T)}.  The  three  deflating  subspaces 
corresponding  to  these  eigenvalues  are  spanned  by  the  three  columns  of  the  3  by  3  identity 
matrix.  For  e  sufficiently  small,  the  spectrum  of  any  pendl  in  ?(£)  will  contain  3  points,  one 
each  inside  disjoint  sets  centered  at  V-q,  0  and  t|.  In  fact,  we  can  draw  3  disjoint  closed 
curves  surrounding  3  disjoint  regions,  one  around  each  X?ct  such  that  each  pendl  in  P(e)  has 
exactly  one  eigenvalue  in  the  region  surrounded  by  each  closed  curve.  Similarly,  the  three 
deflating  subspaces  corresponding  to  each  eigenvalue  remain  close  to  orthogonal.  Thus,  for  € 
sufficiently  small,  we  partition  ct  into  three  sets,  CT;={l/-n},  a2  =  {0}  and  a3={T|}. 

As  c  increases  to  t|/v2,  it  becomes  impossible  to  draw  three  such  curves  because  there 
is  a  pencil  almost  within  distance  t;/ V2  of  A-kB  with  a  double  eigenvalue  at  r\/2: 


'l   0   0 

•n  0  0 

0   0   0 

-X 

0    1   0 

0     0     Tl 

0   0   1 

10       0 

•n  0  0 

0  Ti/2     ? 

-X 

0    1   0 

0     0     Ti/2 

0   0   1 

where  ?  is  an  arbitrarily  small  nonzero  quantity.  Furthermore,  there  are  no  longer  three 
independent  deflating  subspaces,  because  the  ^  causes  the  two  deflating  subspaces  originally 
belonging  to  0  and  t|  to  merge  into  a  single  two-dimensional  deflating  subspace.  We  can, 
however,  draw  two  disjoint  closed  curves,  one  around  1/t)  and  the  other  around  0  and  t|, 
such  that  every  pencil  in  P(t)  has  one  eigenvalue  inside  the  curve  around  1/t|  and  two 
eigenvalues  inside  the  other  curve.    In  this  case  we  partition  cr  =  {0,-n}  [J  {l/r\}  =  a  [J  ctj- 

As  e  increases  to  1,  it  no  longer  becomes  possible  to  draw  two  disjoint  closed  curves  any 
more,  but  merely  one  around  all  three  eigenvalues,  since  it  is  possible  to  find  a  pencil  inside 
P(e)  with  Tl  and  1/t|  having  merged  into  a  single  eigenvalue  near  1,  as  well  as  another  pencil 
inside  P(€)  where  0  and  ti  have  merged  into  a  single  eigenvalue  near  r\/'2.  In  this  case  we 
cannot  partition  a  into  any  smaller  sets. 


This  example  motivates  the  definition  of  a  stable  decomposition  of  a  regular  pencil:  the 
decomposition  in  (1.1)  is  stable  if  the  entries  ot  P,  Q,  S  and  T  are  continuous  and  bounded 
functions  of  the  entries  of  A  and  B  as  A-\B  vanes  inside  ?(«).  In  particular,  we  insist  the 
dimensions  n,   of  the  5,,- XT;;   remain  constant  for  A-XB   in  P(€).   This  corresponds  to 

4 

partitioning  cr=|J  cr,  into  disjoint  pieces  which  remain  disjoint  for  A-X_5   in  P(t).   We 

/-: 
illustrated  this  disjointness  in  the  example  by  surround  each  cr,  by  its  own  disjoint  closed 

curve.  For  numerical  reasons  we  will  also  insist  that  the  matrices  P  and  Q  in  (1.1)  have  their 
condition  numbers  bounded  by  some  (user  specified)  threshold  TOL  for  all  pencils  in  P(£). 
This  is  equivalent  to  insisting  that  the  deflating  subspaces  belonging  to  different  cr,  not 
contain  vectors  pointing  in  nearly  parallel  directions. 

In  the  above  example,  as  t  grows  to  V V2,  there  are  pencils  in  P(e)  where  the  deflating 
subspaces  belonging  to  r\  and  0  become  nearly  parallel: 

0   .  .0 

The  two  right  deflating  subspaces  in  question  are  spanned  by  [0,1,0]^  and  [0,-1/$, I]'", 
respectively,  which  become  nearly  parallel  as  $  approaches  0.  The  numerical  reason  for 
constraining  the  condition  numbers  of  P  and  Q  is  that  they  indicate  approximately  how  much 
accuracy  we  expect  to  lose  in  computing  the  decomposition  (1.1)  [Demmell].  Therefore  ±e 
user  might  wish  to  specify  a  maximum  condition  number  TOL  he  is  willing  to  tolerate  in  a 
stable  decomposition  as  well  as  specifying  the  uncertainty  €  in  his  data. 

With  this  introduction,  we  can  explain  our  first  main  result  which  is  a  criterion  for 
deciding  whether  a  decomposition  (1.1)  is  stable  or  not: 

b 

Theorem  A:  Let  A-\B ,  e  and  TOL  be  given.  Let  a  =  IJ  cr,  be  some  partitioning  of  a  into 
disjoint  sets.  Define  x,  for  l^isb  as 

(2-(pi  +  q}))-  +  2-raax  (p,,q,) 


0  " 

•n  0  0 

T1/2 

-X 

0    1    0 
0   0   1 

min  (Dif„(a;,a-a,)  ,  Dif,(CT_,,a-a;) 
where  p,,  q,,  Dif^  and  Dif,  will  be  explained  below. 
The  corresponding  decomposition  (1.1)  is  stable  if  the  following  rwo  criteria  are  satisfied: 


(1.2) 


max  X,  <  1  n  T> 

and 

2  ■  b  ■    max    (p,  ,  q.)  ■  max  ( V-)  <  TOL    .  (1  4) 

If  we  have  no  constraint  on  the  condition  numbers  (i.e.  TOL  =  ^),  then  we  have  the  foUowmg 
stronger  test  for  stability: 

^-  (Pi  +  <?,) 
:s/s4      Dif^(CT;,a-CT,) 

We  will  explain  the  quantities  p,,  q,,  'Dif^{(T,,cr-a.),  E>if,(a,,a-CT,),  and  Dif^(a,,a-cr,) 
in  the  next  section.  For  now  let  us  say  that  these  bounds  are  fairly  tight,  and  in  fact  the  factor 
b-max  (p!,qj)  in  the  theorem  is  essentially  the  least  possible  value  for  the  maximum  of  k{P) 

and  k{Q)  where  P  and  Q  block  diagonalize  A  -  X5 ,  the  center  of  P(£). 


The  quantities  Dif^,  Dif;,  /?,,  qj,  and  Dif^  may  also  be  straightforwardly  (if  possibly 
expensively)  computed  using  standard  software  packages.  Also,  nearly  best  conditioned  block 
diagonalizing  P  and  Q  in  (1.1)  can  also  be  computed.  Therefore,  it  is  possible  to 
computationally  verify  conditions  (1.3)  to  (1.5)  and  so  to  determine  whether  or  not  a 
decomposition  is  stable  as  defined  above,  as  well  as  to  compute  the  decomposition. 

The  criterion  (1.3)  is  essentially  due  to  Stewart  [Stewartl].  Our  contribution  is  the 
bound  on  k(P)  and  k(Q)  in  (1-4),  as  well  as  the  stronger  bound  in  (1.5). 

The  case  when  A-\B  is  singular  is  more  difficult  than  the  regular  case  because  the 
eigenstructures  we  are  interested  in  computing  are  nongeneric,  i.e.  they  will  be  destroyed  by 
almost  all  perturbations  of  the  pencil  A-\B.  The  structure  which  interests  us  will  exist  only 
when  A  and  3  lie  in  a  proper  variety  [Van  Doorenl,  Waterhouse].  A  proper  variety  is  the 
solution  set  of  a  set  of  polynomial  equations  in  the  entries  of  A  and  B  such  that  the  solution 
set  is  of  positive  codimension  and  hence  of  measure  zero.  What  can  stability  mean  in  this 
context?  For  example,  a  nonsquare  pendl  A  —  XB  will  have  the  same  KCF  for  almost  all  A 
and  B,  and  this  KCF  can  be  determined  by  the  dimensions  of  A  and  B  alone.  Also,  it  is 
possible  to  perturb  a  square  singular  pencil  arbitrarily  little  making  it  regular  with  its 
eigenvalues  anywhere  in  the  extended  complex  plane  [Wilkinson]. 

The  known  algorithms  for  computing  nongeneric  eigenspaces  [Van  Doorenl, 
KSgstrfiml,  KSgstrfim2]  are  stable  in  the  'jsual  sense:  they  return  the  eigenstructure  for  a 
nongeneric  pencil  within  a  small  distance  of  the  input  pencil,  if  one  exists.  Small 
perturbations  in  the  input  pencil  (or  during  the  computation)  lead  to  the  eigenstructure  for  a 
different  nongeneric  pencil,  but,  under  certain  assumptions,  one  with  the  same  kind  of 
eigenstructure  (i.e.  the  same  singular  structure  in  the  KCF)  as  the  other  nongeneric  pencil.  In 
other  words,  the  algorithms  perturb  the  input  pencil  in  such  a  way  that  it  is  guaranteed  to  lie 
on  the  same  proper  v  jiety.  This  leads  us  to  ask  whether  we  can  deduce  perturbation  bounds 
of  the  features  of  the  eigenstructure  which  remain  stable  under  this  kind  of  nongeneric 
perturbation.  In  order  to  do  so,  we  must  determine  what  features  are  stable  and  of  interest. 
We  choose  features  of  interest  in  systems  theory  [Van  Dooren2,  Rosenbrock],  which  we 
describe  below. 

Care  must  be  exercised  in  choosing  stable  features,  since  the  following  example  shows 
that  the  spaces  spanned  by  the  Pi  and  Q,  are  no  longer  all  well  defined  in  the  singular  case. 
Consider 


P(A-\B)Q--  = 


1      0 
0   X-1 


1   0  x' 

—  1 

0    1   X 

= 

0  0   1 

0 

x-1 


As  X  grows  large,  the  space  sparmed  by  2;  (the  last  column  of  Q)  can  become  arbitrarily 
close  to  the  space  sparmed  by  (2;  (the  first  two  columns  of  Q).  Similarly  the  space  spanned 
by  /*;  (the  last  column  of  P)  can  become  arbitrarily  close  to  the  space  spanned  by  P_  (the  first 
column  of  P).  Thus,  we  must  modify  the  notion  of  deflating  subspace  used  in  the  regtilar 
case,  since  these  subspaces  no  longer  all  have  unique  definitions. 

The  correct  concept  to  use  is  reducing  subspace,  as  introduced  in  [Van  Dooren3].  P  and 
Q  are  reducing  subspaces  for  A-X5  if  P=AQ  +  5Q  and  dim(P)  =  dim(Q)-dim(iV,),  where  A^,. 
is  the  right  null  space  oi  A-\B  over  the  field  of  rational  functions  in  X..  It  is  easy  to  express 
dim(A^,.)  in  terms  of  the  KCF  of  A-\B:  it  is  the  number  of  Z.^  blocks  in  the  KCF  [Van 
Dooren3].  In  the  example  above,  N^  is  one  dimensional  and  spanned  by  [1,-X,0]^.  The 
nontrivial  pair  of  reducing  subspaces  are  spanned  by  P   and  Q.  and  are  well  defined. 

We  will  prove  that  if  a  singular  pencil  is  perturbed  in  such  a  way  that  it  has  reducing 
subspaces  of  the  same  dimension  as  the  unperturbed  penal,  then  the  perturbed  subspaces  will 
be  close  to  the  unperturbed  spaces  if  the  perturbation  is  small  enough.  Since  a  pencil  can 


have  several  reducing  subspaces  of  the  same  dimension,  care  must  be  taken  to  distinguish 
these  different  subspaces;  we  show  that  reducing  subspaces  of  the  perturbed  pencil  must 
either  be  close  to  the  unperturbed  spaces  or  a  bounded  distance  away.  This  result  is  stated  in 
Theorem  6. 

We  can  apply  this  result  to  compute  perturbation  bounds  for  standard  algorithms  for 
computing  reducing  subspaces.  For  small  enough  perturbations  of  the  input  data  of  one  of 
these  algorithms,  we  will  show  that  the  algorithm  computes  reducing  subspaces  a  small  angle 
away  from  the  unperturbed  spaces.   This  application  is  incorporated  in  Algorithm  1. 

Reducing  subspaces  are  of  interest  in  systems  theory,  as  the  following  example  shows. 
Consider  the  pencil  A-\B  =  [D|C-X/].  The  pencil  [D\C-\J]  has  the  same  KCF  for  nearly 
all  matrices  C  and  D  of  a  fixed  dimension:  L^  blocks  only.  In  systems  theory  [Wonham]  this 
property  is  called  complete  controllability  of  the  pair  (CJD).  Nonetheless,  systems  theory 
applications  [Wonham,  Van  DoorenZ]  require  knowing  if  (C  J))  is  uncontrollable  ([D\C-\I] 
has  a  regular  part  in  its  KCF)  or  nearly  so.  If  (C  JI>)  is  uncontrollable,  we  may  ask  what  its 
smallest  pair  of  reducing  subspaces  P  and  Q  are.  In  systems  theory  the  P  of  this  pair  is  called 
the  controllable  subspace  C{C ,D)  of  the  pair  (CJ)),  and  is  of  interest  in  designing  control 
systems.  Theorem  6  on  the  stability/  of  reducing  subspaces  will  imply  that  the  controllable 
subspace  of  a  pair  is  computed  stably  by  the  standard  algorithms.  This  result  is  stated  in 
Corollary  4:  if  the  system  (C  J})  is  perturbed  to  (C  +  f^X'  +  fo)  such  that  the  controllable 
subspace  of  perturbed  pencil  C{C +  Ec,D  +  Er))  has  the  same  dimension  as  C(CJ)),  and  if 
II  (£c,£'£))||5  is  small  enough,  then  the  largest  angle  between  C(C,£>)  and  C{C  +  E^J)+Ep)  is 
bounded  by  a  constant  (which  can  be  computed  using  standard  software  from  C  and  D)  times 
II  (£■(;,££,) II  r  (or  else  the  largest  angle  is  bounded  away  from  0).  Similar  comments  apply  to 
the  unobservable  subspace  and  other  features  of  a  control  syst  m  (see  [Van  Dooren2]  for  a 
discussion). 

Also  of  interest  are  the  eigenvalues  of  the  regular  part  of  a  nongeneric  singular  pencil. 
A  generic  perturbation  of  a  square  singular  pencil  will  make  the  pencil  regular  and  can  be 
chosen  to  put  the  eigenvalues  in  arbitrary  locations  in  Jie  extended  complex  plane.  A  generic 
perturbation  of  a  nonsquare  singular  pencil  will  make  the  regular  part  disappear.  As  before, 
however,  the  standard  algorithms  will  produce  a  nongeneric  pencil  with  a  regular  part  of  the 
same  size  as  nearby  pencils.  We  will  show  that  the  spectnim  of  the  regular  part  is  stable,  i.e. 
we  will  derive  perturbation  bounds  on  the  eigenvalues  of  the  regular  part. 

Returning  to  our  systems  theory  example,  the  eigenvalues  of  the  regular  part  of  the 
pencil  [D\C  —  \I]  are  called  the  input  decoupling  zeroes  or  uncontrollable  modes  of  the  pair 
(C,£)),  and  are  of  interest  in  designing  control  systems.  They  are  the  eigenvalues  of  a  system 
i  =  Cx+Du  which  cannot  be  controlled  by  any  choice  of  feedback  u  -  Fx.  Our  result  on  the 
stability  of  the  spectrum  of  the  regular  part  will  show  that  these  eigenvalues  are  computed 
stably  by  the  standard  algorithms.  This  result  is  stated  in  Corollary  5. 

All  the  results  on  the  singular  case  as  well  as  applications  to  systems  theory  are  new. 

The  rest  of  this  paper  is  organized  as  follows.  Section  2  notation  and  basic  lemmas. 
Section  3  will  be  devoted  to  analyzing  the  regular  pencil  case  and  proving  Theorem  A.  This 
work  is  a  generalization  of  work  done  in  the  doctoral  thesis  of  one  of  the  authors  [Denimel2, 
Demmel  and  KSgstrOm].  Section  4  will  be  devoted  to  the  stability  results  for  the  singuJar  case 
with  applications  of  these  results  to  problems  in  systems  theory.  Section  5  contains  numerical 
examples. 


2.  NoUtion  and  Basic  Lemmas. 

||a:||  will  denote  the  Euclidean  norm  of  the  vector  x.  ||A||  will  denote  the  matrix  norm 
induced  by  the  Euclidean  vector  norm.  \\A\\^  will  denote  the  Frobenius  norm,  ct^^  (A)  and 
""nm  (^)  ^'^  denote  the  smallest  and  largest  singular  values  of  the  matrix  A.  If 
A  =  Udiag(cri)V*  is  the  singular  value  decomposition,  then  the  pseudoinverse  of  A,  A*,  is 
given  by  A*  =  Vdiag(CT,^)C^,  where  cj^^^al'-  if  (j,*0  and  c!-,^  =  0  if  ct,  =  0.  k(A)  will  denote 
the  condition  number  cr^3^(A)/crj^.,(A)  of  the  matrix  A;  this  applies  to  nonsquare  A  as  well. 
A®B  will  denote  the  Kronecker  product  of  the  two  matrices  A  and  B  :  A®B  -  [A,y  B].  Let 
coL4  denote  the  column  vector  formed  by  taking  the  columns  of  A  and  stacking  them  atop 
one  another  from  left  to  right.  Thus  if  A  is  m  by  n,  colA  is  mn  by  1  with  its  first  m  entries 
being  column  1  of  A ,  its  second  m  entries  being  column  2  of  A ,  and  so  on. 

The  foOowing  lemma  is  a  generalization  of  a  theorem  in  [Stewartl,  Theorem  3.1] 
which  we  need  for  both  the  regular  and  singular  cases  later.  Let  7  be  an  m  by  n  matrix  of 
full  rank,  and  T*  its  n  by  m  pseudoinverse.  Let  4>  be  a  continuous  map  from  C"  to  C 
satisfying 

i)        II  «1>WII  =s  114.11   •|l:r|P 

ii)       ||4.(x)-<},W||  s    2-il4.|l  maxdlxll  ,||y|l)-||x->i|  ' 

for  some  constant  ||  (t)||  >0.  Let  ^  be  in  C".  ^-,- 

Consider  the  equations 

Tx  =  g-H^)  (2.1) 

and 

X  =  r*(^-<t,(x))  (2.2) 

We  are  interested  in  whether  these  equations  have  a  solution,  and  what  the  solutions  have  to 
do  with  one  another.  There  are  three  cases,  depending  on  whether  m  =  n  (T  is  invertible), 
m<n  ((2.1)  is  underdetermined),  or  m>n  ((2.1)  is  overdetermined).  The  case  m-n  is  dealt 
with  by  Stewart's  original  theorem  and  will  be  used  to  analyze  the  regular  pendl  case.  The 
other  two  cases  are  not  dealt  with  by  Stewart  and  are  used  in  the  singular  case. 

Lemma  1:  Assume  that  ||  T*  || ,  ||  <{)  || ,  and  ||  g  ||  satisfy 

K-  Ijgll  •114.11  •||n|i-<  j  . 

Then  equation  (2.2)  has  a  unique  solution  x  inside  the  ball 

llxll  s   ^  ~  (^-J'^r'  .  \\g\\  .  \\T-\\  <  2  ■  \\g\\  •  117*11    .  (2.3) 

This  solution  i  of  (2.2)  has  the  following  relationship  to  the  solution  x  of  (2.1): 

Case  1:  If  m  =  n,  x=i  is  the  unique  solution  of  (2.1)  (in  the  ball). 

Case  2:  If  m<n,  x=i  is  a  solution  of  (2.1),  but  it  is  not  necessarily  unique. 

Case  3:  If  m>n,  and  if  (2.1)  has  a  solution  x  in  the  ball,  then  x=x.  Thus,  (2.2)  may  have  a 
solution  whereas  (2.1)  may  not. 

Furthermore,  in  cases  1  and  3,  if  (2.1)  has  a  solution  which  does  not  lie  inside  the  ball 
in  (2.3),  it  must  lie  outside  the  ball: 

||i||  ^     ^  +  (1-4k)-  ■ 


I*roof:  The  proof  that  (2.2)  has  a  solution  under  the  given  conditions  is  identical  to  the 
original     proof     of     Stewart's     theorem,     so     we     will     just     outline     it     here.     Let 

<=\\8\\  ■  H\\  •  l|r*||^  <  1/4.   Define  the  iteration 

with  j:g=0.  It  is  easy  to  show  this  is  a  contraction  and  converges  to  a  unique  solution  i 
satisfying 

||;||  ,,    1  -  (1-40--^ 


XK 


l^ll  -IIT-^II  <2|' 


Given  this  solution  for  (2.2)  the  other  results  are  easy.  Case  1  is  the  standard  case  already 
considered  in  [Stewartl].  In  this  case,  T~  =  T'"  so  equations  (2.1)  and  (2.2)  are  equivalent. 
In  case  2,  note  that  T  •  T*  =  I„,  so  multiplying  i  =  T'{g  -  <t)(i))  on  the  left  by  T  yields  the 
result.  Nonuniqueness  is  easy  to  see:  consider  ({>"0.  In  case  3  we  have  7*  ■  J  =  /„,  so  if  i  is 
a  solution  of  (2.1),  multiplying  (2.1)  on  the  left  by  T"  yields  i  =  T*(g-^{x)),  implying 
x=x. 

The  proof  of  (2.4)  is  as  follows.  Given  a  solution  i  of  (2.1)  it  must  satisfy 

^\\Tx\\  =  \\g+Hm  ^  \\g\\  + 114.11  -iiiii^ . 


Solving  the  quadratic  inequality 


l^ll  +  HI 


\\T^\\ 

for  ||i||  yields  two  inequalities  for  ||i|l ,  the  one  in  (2.3)  and  the  one  in  (2.4).   Q.E.D. 
Note  that  this  theorem  works  in  real  vector  spaces  as  well. 

We  need  another  lemma  from  the  literattire  which  we  ate  here.  Let  P  =  [P.\  ■  ■  ■  [P^]  be 
a  square  partitioned  matrix  where  P,  has  n,  columns.  We  want  to  know  how  well  conditioned 
we  can  make  P  subject  to  the  constraint  that  the  columns  of  P,  span  a  given  n, -dimensional 
subspace  P,.  Oearly,  if  P  satisfies  this  constraint  then  any  other  matrix  also  satisfying  the 

constraint  can  be  written  as  PD,  where  D  =  diag(Z3M ■D„„),  D,,  a  nonsingular  n,  by  /», 

matrii. 

L«nuna  2:  [Demmell]  Let  9,  be  the  smallest  angle  between  any  nonzero  vector  in  P,  and  the 
the  subspace  spanned  by  all  the  other  Pj,  j*i.  Then  if  /*  is  any  matrix  such  that  F,  spans  P,, 
we  have 

max  cot  8,/2   ^  inf  k(PD)  ^  b  ■  max  esc   9,     .  cy  s^i 

Another  way  to  express  this  inequality  is  as  follows:  partition  P~-  into  groups  of  rows  as 
follows: 


/.-!  = 


P^ 


where  P'  is  n,  by  n.  Then 


and 


CSC    e,    =   \\PrP'\ 


cot  e,/2  =  WPfP'W  +  (||/'r/''i|--i)^'-  . 

Thus,  the  upper  and  lower  bounds  in  (2.5)  differ  by  a  factor  of  at  most  b.  The  matrices  P,P' 
are  oblique  projections  onto  P,  paraJJel  to  Py,  i=^j.  If  we  choose  D  so  that  the  columns  of 
PPu  are  orthonormaJ,  then  k{PD)  lies  within  the  bounds  of  (2.5). 

If  6  =  2  and  without  loss  of  generality  we  assume 


P  = 


0  /. 


then  the  choice  D;|  =  /„   and  D-2  =  (1  +  ii^ll")   ""^'h  makes  k{KD)  achieve  its  lower  bound 
above,  namely  ■  '    '•- 

k(pd)  =  ii^ii  +  (i  +  WRpy^'^' 

In  terms  of /?,  we  may  write  •   r     ■-.':•" 

Wp^p'W  =  (i+||;?|P)^'^  . 


3.  Rtgnlar  Pendb 

In  trying  to  stably  decompose  A-'kB  into  b  blocks  as  in  (1.1),  we  will  first  study 
decomposing  A-\B  into  two  blocks.  Let  cT  =  aXJa-  be  a  possible  decomposition  of 
aiA-XB)  into  disjoint  subsets.  Define  the  dissociation  of  a;  and  cr-,  written  diss(CTi  ,  ct;),  as 
the  smallest  pertuibation  {A-^E)-\{B  +  F)  of  A-kB  (measured  as  ||  {EJ^)]]^)  that  makes  an 
eigenvalue  X^Sct;  coalesce  with  X;€(T;.  (Think  of  X_  and  \;  as  continuous  function  of 
O^jcsl,  where  \;(j:)  is  an  eigenvalue  of  (A+xE)-\(B +xF)  and  X,(0)  =  X,.  As  x  increases 
from  0  to  1,  X.(jr)  and  X;(ir)  move  continuously  until  possibly  X;(1)  =  X-(1).  Thir.  is  possible 
for  II  (£'vf")||  sdiss(cr;  ,  a-)  but  no  smaller.)  It  is  easy  to  see  that  the  first  condition  of  a  stable 
decomposition  of  P(e),  that  the  number  of  eigenvalues  in  each  a,  remain  constant,  holds  if 
and  only  if  diss(cT;  ,  cr-a,)>€  for  all  i.  Thus,  if  we  can  compute  lower  bounds  on 
diss(cr,  ,  cr-cr,),  a  decomposition  of  P(e)  wiU  satisfy  the  first  criterion  of  stability  if  these 
lower  bounds  are  all  greater  than  e.  This  will  be  our  approach  for  proving  Theorem  A. 

We  begin  with  a  simple  lower  bound  on  diss(cT,  ,  cr-cr,)  based  on  an  extension  of  the 
Bauer-Fike  Theorem  [Bauer  and  Fike]  to  pencils.  It  can  also  be  seen  as  an  extension  of 
Gershgorin's  Theorem  to  pencils  [Stcwart2].  We  include  it  even  though  it  is  generally 
weaker  than  our  later  results  because  it  illustrates  the  nature  of  the  problem. 

Lemma  3:  Suppose  P~'-(A-\B)Q  =  diag(a,-Xp,).    Then  if  s/c  (where  |i|*+|cp=l)  is  an 
eigenvalue  of  the  perturbed  pencil  {A+E)  —  \{B  +  F),  we  have 

\\p--\\  ■  \\Q\\  ■  \\{EJ^)\\s^vam\coL-s^,\ 

Proof:  Assume  without  loss  of  generality  that  s/c  is  not  an  eigenvalue  of  the  unperturbed 
pencil.  Then 

0  =det(c(A+£)-j(5+/')) 

=  det(p--(c(A+£)-j(5  +  F))(2) 

=  det(diag(ca,-j3,)(/  +  diag((ca,-50,)--)/'-^(c£-5F)2)) 

=  de\{I+(^ag{{ca-s^y-)p-'{cE-sF)Q) 
implying 

1  5  ||diag((cot,-53,)-')/»--(c£-5F)C||^ 

^  (min  \ca-s^,\r-  \\p--\\  ■  \\Q\\  ■  ||c£-5£||^ 


s  (min  K-.p,|)-M|P-^||  -llell  -UEJ) 


\E 


which  is  equivalent  to  the  result  claimed.  Q.E.D. 

Stewart  in  [Stewart2]  expresses  this  result  in  terms  of  an  upper  bound  on  the  chordal 
metric 

|X  -  X'l 


x(x,x') 


(l  +  X-)"-(l4-X'-)-=     • 

where  X  is  an  original  eigenvalue  and  X'  is  a  perturbed  eigenvalue.    We  choose  not  to  do  so 
here  because  we  do  not  need  all  the  properties  of  a  metric  for  our  results. 

Just  as  the  Bauer-Fike  theorem  for  the  standard  eigenproblem  showed  that  the 
eigenvalues  of  the  perturbed  matrix  must  lie  in  disks  centered  at  the  eigenvalues  of  the 
unperturbed  matrix,  this  lemma  shows  that  the  eigenvalues  of  the  perturbed  pencil  lie  in 
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certain  regions  around  the  eigenvalues  of  the  unperturbed  pendJ.   By  requiring  these  regions 
to  be  disjoint,  we  get  a  lower  bound  on  diss(ar,  ,  ct;). 

Theorem    1:    Suppose    P~(A-kB)Q=    diag(a^-X3^,a;y-\3;y)    where    ayp^,€cr.    and 
'^"(^^  '  ^^^  "   V2.l\P-\\.l\Q 


min  CT„ 


tj 


3ii  0: 


Proof:  From  Lemma  3  we  see  that  x  can  only  exceed  diss(aj^  ,  a^)  if  there  exist  i,  j ,  c  and  j 
such  that  |c|'+|jp  =  l  all  satisfying 

\\p-^\\  •  ilCll  •  X  s  max  (lca;j-5fJ:,l,|cay-jp:y|) 


1  '^'J   "2/ 

In  other  words,  jrsdiss(c7.  ,  cr-J  only  if 

■1 


X  2 


which  implies  the  result.  Q.E.D. 


P-' 


mm  a. 

ij 


a 


Unfortunately,  ||/'' 


I Q II  may  be  quite  large  and  the  lower  bound  of  this  theorem 


quite  tiny  because  A  —  \B  may  be  hard  to  diagonalize,  even  if  a^  and  cti  are  well  separated.  A 
more  refined  lower  bound  on  diss(a;  ,  ct;)  which  takes  this  into  account  is  derived  as  follows. 

We  begin  by  reducing  the  pencil  A-XJ  to  a  canonical  form  by  a  transformation  which 
does  not  change  distances  or  angles  between  subspaces,  quantities  we  need  to  preserve.  We 
quote  the  following  lemma: 

Lemma  4:  [Stewart3]  Any  regxilar  A-\B  can  be  transformed  into  upper  triangular  form  by 
multiplication  on  the  left  and  right  by  unitary  matrices.  Further,  the  unitary  matrices  may  be 
chosen  so  that  the  eigenvalues  appear  on  the  diagonal  of  the  transformed  pendl  in  any 
desired  order. 

Standard  software  (the  QZ  algorithm  [Moler  and  Stewart]  and  EXCHQZ  [Van 
Dooren4])  is  available  to  compute  this  decomposition. 

Suppose  now  without  loss  of  generality  that  our  original  pencil  is  in  upper  triangular 
form 


A-\B  = 


with  dim(i4,;  — X5;;)  =  n,  and  a(M;,- VS;,)  =  cr,,  with  a.  and  a;  disjoint. 

We  next  need  to  compute  the  deflating  subspaces  of  this  pencil  belonging  to  cr.  and  a-^. 
Equivalently,  we  want  to  find  P  and  Q  such  that 


A  1 1    A  :-i 

[fl.,   fl.,] 

0     A.- 

-\ 

0     B-2 

p-''(A-kB)Q  = 


Ui     0  1 

IB,.     0  1 

—  X 

0         A;: 

0     S;: 

(3.1) 


with  cr(A;,-X5;;)  =  (J,.  It  is  easy  to  see  that  the  pair  of  deflating  subspaces  belonging  to  cr^ 
are  both  spanned  by  /':  =  Q.=  [^„ JO]'".  Without  loss  of  generality  we  seek  ±e  other  deflating 
pair  in  the  form  P^^fL^I/^^]^  and  Q:  =  [R^'\I„y,  which  leads  to  the  equation 
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0     h 


0  A-,--\B-, 


"i 
0    /, 


A.i-X^.j  0 

0  A:-\B. 


or 


A 1  i/v     LtA^n  —       Ap 


'11^ 


n 


(3.2a) 

(3.2b) 

This  is  a  set  of  Inn-,  equatiorL.  in  Inn-,^  unknowns,  the  entries  of  L  and  R.  It  is  not  difficult 
to  see  that  we  can  rewrite  (3.2a)  and  (3.2b)  as  foOows: 


B~  R-LB-^  =  -B. 


col/?' 
colL 

-z. 

colR 
colL 

= 

-colAo' 

-co\B^ 

■ 

'■               ■' 

(3.3) 


One  can  show  Z„  is  nonsingular  if  and  only  if  cr,  and  ct;  are  disjoint  [Stewart3]  as  we  have 
assumed.  Qearly,  If  Z„  is  ill  conditioned,  L  and  R  may  be  very  large,  implying  the  deflating 
subspaces  for  a.  and  cri  are  close  together.  As  we  will  see  later,  this  means  we  can  expect 
diss((Ti  ,  CT;)  to  be  small  as  well.  With  this  motivation,  we  define  as  in  [Stewartl] 

Dif,(Aii^,2;Bii,5::)  -  cr^^(Zj   , 

the  smallest  singxilar  value  of  Z„.  A  trivial  consequence  of  this  definition  is  that 

ULM\e^  ll(Ai:^i:)ll5/Dif-('^i:^2:iflii'^::)  ■ 
It  is  easy  to  see  that  Dif^  is  not  changed  if  we  multiply  A,,  and  5,,  on  the  nght  by  any  unitary 
U,  and  on  the  left  by  any  unitary  V,.  This  implies  that  given  A-kB,  Dii^  is  determined  only 
by  specifying  c7.  =  (t(A,,-\5. J  and  cr-  =  (/i;2-\5;;),  in  that  order.  (We  will  return  to  the 
dependence  on  order  later.)  We  will  therefore  write  Dif„(cr;,CT;)  when  A-kB  Is  clear  from 
context,  or  just  Dif^  if  ct;  and  ct;  are  clear  as  well.  We  record  a  fact  about  I>if„  we  will  need 
later. 
L€mma  5:  [Stewartl] 

Difu('4n+'£ii322+£'::;flu+'fii^::+^2:)  ^  Dif-C-^ii^zi^^.i,/--,)  -  ^2  •  UE, ^^^.Z'^'n J" 22)h 

Other  properties  of  Dif,  can  also  be  found  in  [Stewartl]. 

So  now  we  know  how  to  compute  L  and  R,  and  therefore  blockdiagonalizing  P  and  Q 
for  A  —  kB.  For  our  next  lower  bound  on  diss((7;  ,  cr-J,  we  will  need  to  use  slightly  different 
P  and  Q.  Just  as  Dif^  is  specified  by  ct^  and  cti  alone,  it  is  easy  to  see  that  the  norms  of  L  and 
R  are  determined  only  by  ct.  and  a..  Let  p  denote  (l+||Ll|')^'-  and  q  denote  (1+ ||/?|| -)^'-. 
From  Lemma  2,  we  know  p  and  q  are  lower  bounds  on  the  condition  numbers  of  any 
blockdiagonalizing  P  and  Q,  respectively.  In  the  language  of  Lemma  2,  p  is  the  norm  of  the 
projector  onto  either  left  deflating  subspace  parallel  to  the  other.  Similarly,  q  is  the  norm  of 
a  projector  on  the  right,  p  and  q  are  defined  given  only  ct;  and  a2  since  ||L||  and  \\R\\  are. 
Define  also 


Po  = 


"i 
0    /, 


0     <?-"/„. 


and 
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Co  = 


0    /. 


P     h, 
0 


0 


=  (^(^o)  •  xceo))''  ^  p  +  <? 


(3.4) 


It  is  easy  to  verify  that 

ll^oll  -11  Go" 

The  quantity  \\Pq\\  ■  HCo'^'ll  ^1^  pl^y  ^^  same  role  for  regular  pencils  as  the  condition 
number  of  the  best  conditioned  block  diagonalizing  similarity  plays  for  the  standard 
eigenvalue  problem:  they  measure  the  sensitivity  of  eigenvalues  to  perturbations. 

We  need  one  more  definition  before  presenting  our  next  lower  bound  on  diss(cr;  ,  ctJ: 


Difx(^ii^::;5:i^::) 


inf 


Just  as  with  Dif„  it  turns  out  Dlf^  is  specified  hy  A-\B  and  cr^  and  a;  alone,  permitting  us  to 
write  Dif^((T|,cT;)  when  A-\B  'i%  clear  from  context  and  just  Dif^  when  a,_  and  cti  are  dear  as 
well. 

Dify  will  play  the  same  role  in  this  analysis  as  sep^  [Varah,  DemmeLZ]  played  for  the 
standard  eigenvalue  problem:  it  is  the  size  of  the  smallest  perturbation  that  makes  the  two 
pencils  A i^-XJii  andi422-X-9;2  have  a  common  eigenvalue: 

Lemma  6:  Let  II  (£';i.-£'::,^iii^::)l|£  be  the  size  of  the  smallest  perturbation  such  that 
(Ai;+£;J-X(fl;i+F;i3  aud  (A;2+£::)-X(522+F::)  have  a  common  eigenvalue.  Then 

Dif;,(A,i^::;5,;3::)  =  II  (£::i,^::^ii^^::)l!5  • 

Proof:  Choose  c  and  j  in  the  definition  of  Dif^  to  attain  the  infimum.  Next  choose  E  and  F 
each  of  rank  1  and  such  that  ||£|||-+||f Jl^- =  DifJ  satisfying  (t^J^cA.^-^-sB.^i+E)  =  Q  and 
cTjj^jj(cA22-j5;;  +  /")==0.  These  last  equations  can  be  rewritten 


and 


Now 


a^,(c(A,,+?£)-5(5,.-JE))  -  a^,(c(A,:+£ J-.(5,,  +  f ,J)  =  0 


<r^MAii+^)-^iB.--JF))  -  cT^(c(A::  +  £-2)-5(B:-  +  F::))  =  0 


\\{E,„E,„F,,J^,m=  m\l+  ll'f^lli=Dif:   . 

i.e.  this  particular  choice  of  £■;,  £2;,  £m,  and  £2:  satisfies  the  upper  bound,  and  hence  so 
must  the  smallest  perturbation.  To  show  Dif;^  is  a  lower  bound,  let  £,;  and  £,,  for  »  =  1,2  be 
any  perturbations  such  that  (A, .  +  £,2)-X(fl. .  +  £■•)  and  (A;2  +  £:;)-X(fl22  +  '^::)  ^'^ve  a 
common  eigenvalue  s/c  with  |c|-+|j|"  =  l.  Then 

0=<T^(c(A„  +  £„)-:r(5,,  +  £,,))  =  a^(cA„-j5,;+(c£,,-5£„))  &  cr^,(cA,-55,,)  -  ||c£,-i£,,| 
so  by  the  definition  of  Dif;^  we  get 

Dif,  ^  (||c£;,-.£,,||j  +  ||c£22-^£2:lli)^'-  ^  II  (f::,^::^::,-^::)  II 5   • 
Q  E.D. 

This  lemma  immediately  yields  a  simple  upper  bound  on  diss(a,  ,  G'^: 
Corollary  1: 
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We  are  now  ready  to  prove  our  next  lower  bound  on  diss((T;  ,  a;): 


Theorem  2: 


diSS((T;    ,    C7;) 


Dif>^(cT.,(T.) 

'^2(p  +  q)    ■ 


Proof:  Choose  c  and  s  so  that  |c|-+|j|"  =  l  and  also  so  that  s.'c  is  an  eigenvalue  of  the 
perturbed  pencil  {A  +  E)-\{B  +  F)  but  not  of  -4 - X_3 .  Then 

0  =  det(ciA  +  E)-s(B+F)) 
=  dtt(Po''icA-sB)Qa  +  P^'-(cE  -sF)Qo) 


=  det( 


CA,;-ji5M  0 

0  cA-,-sB.^ 


+PoKcE-sF)Qo) 


'cA,^-sB- 


=  det(/  + 
implying 

1  = 


1-1 


cA-i  —  sB^~ 


P^'{cE-sF)Qo) 


0 


cA,^i—sB,i 

0  cA;-sB 


22 


Po'-{cE-sF)Qoh 


5  max(ll(cA,;-55J--ll,||(cA:,-.5::)--ll)  •  WP^'W  ■  llQoll  •  \\cE-sF\\, 

or,  rearranging  and  using  (3.4) 

min  (II  (cA..-55,/)~-|l  "Ml  (cAy^-sByX'W  "-) 


p  +  q 


^    \cE-sF 


This  in  turn  implies 


\(E,F)\\s^  \\cE-sF\],^ 


rain  (cr^^(cA-.-sB.  J, d^^icA^.^-sB::)) 


p  +  q 

Thus,  the   eigenvalues  s/c  of  the  perturbed  pencil  lie  Ln  clusters  about  the  eigenvalues  of  the 
unperturbed  one,  these  clusters  being  defined  by 

a„.AcA,,  —  sB„) 

\\{E,F)\\,^^^=^ ^ 

p  +  q 

for   1  =  1,2.    These   clusters   can   only   overlap    (a   necessary   condition   for   coalescence   of 
eigenvalues)  Lf  for  some  s  and  c 

p  +  q 

Differ, ,CT;) 

which  implies  our  result.  Q.E.D. 

An  immediate  corollary  of  our  last  theorem  is 
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Corollary  2:  Suppose 


is  block  diagonal.  Then 


A-kB  = 


A.^-X^.;  0 

0  A.--X5.1 


Finally,  we  use  Theorem  2  to  prove  part  of  Theorem  A:  deciding  if  a  decomposition  is 
stable  if  there  is  no  constraint  on  the  condition  number  of  P  eind  Q: 

b 

Corollary  3:  Let  a=  IJ  cr,  be  a  partitioning  of  the  spectrum  oi  A-kB.  Let  p,  and  <7,  be  the 

values  of  (l+|l^|p)^'^  and  (1+ ||Z,||-)--  corresponding  to  a,  and  a-cr,  in  (3.3).  Then  [J  ct, 

/-I 
is  a  stable  decomposition  of  P(t)  in  (1.1)  if  there  is  no  constraint  TOL  on  the  condition 
numbers  of  P  and  Q  and  if  ...,.■. 


max  f.     ^—7,  .  . 

is.'si        Dii^(ct;,ct— cr,) 


<  1 


Proof:  The  last  inequality  implies  that  e  <  diss(cr,  ,  cr-a,)  for  all  Itsi'Sb.  Q.E.D. 

We  can  now  derive  our  third  lower  bound  on  diss(cr;  ,  ct;),  one  which  will  also  lead  to 
bounds  on  k{P)  and  <(Q),  where  P  and  Q  are  the  blockdiagonalizing  equivalence 
transformations  in  (1.1).  We  will  use  essentially  the  same  approach  as  in  [Stewartl].  We 
assume  as  before  that  A-X5  is  upper  triangular  and  that  F  and  Q  blockdiagonalize  i  as  in 
equation  (3.1).  Consider  the  perturbed  pencil  (A  +  E)-\{B  +  F).  Premultiplying  by  P~-  and 
postmultiplying  by  Q  yields  the  pencil 


P-^-  ((A  +  £)-X(5  +  F))  •  Q  = 
We  now  seek  Pcf  and  Q^jr  of  the  forms 


■A..+£,j 


£■2 

A-~  +  E~ 


-X 


(3.5) 


and 


Qef  = 


such  that  premultiplying  (3.5)  by  Pcf   and  postmultiplying  it  by  Q^;r  blockdiagonalizes  it 
Performing  these  multiplications  and  rearranging  the  result  yields 

iA,j  +  E,j-L.E-.J  {I  +  RR-J  (Am  +  £J/?.-L.(A:;+£:2)  +  £o  -  LE2:^: 

I-(A,,  +  £iJ-(A22+£'::)/?:  +  f-  -  L-E^R2  (^2;+^2:+^:^ii)  (^+^2^ J 

(3.6) 
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-\ 


Li(B  11+ F yJ-{B 22+ F.-)R 2  +  F21  -  J-F-R.  (5::  +  ^::-^^:^i:)  O+R-R) 

Setting  the  upper  right  and  lower  left  comers  of  the  pencil  two  zeroes  yields  two  sets  of 
equations: 

{A.^-  +  E._-)R-L-{A2i+E2-^  =  -E~_2  +  lE2Ri  (3.7a) 


(5,1+f  J/?.-L:(52:  +  /'::)  =  -^jl+^/i.^i 


(3.7b) 


and 


(3.8a) 

(3.8b) 

We  wish  to  apply  Lemma  1  to  solve  these  sets  of  nonlinear  equations.  To  solve  (3.7)  we 
make  the  identifications  x  "  [Z.;!/?  J, 


L2(B:i+F-^i)-iB-2  +  F22)R2  =  -'f:i+^:'''i;^2   • 


Tx 


(^,.  +  £j;?.-Z,.(A-,  +  £-;-) 
{B:,+F.JR  -L,{B 22  + F 22) 

-F- 


and 


<}>(^) 


L,£..J?. 
LF.R 


From  Lemma  5,  we  get 

II 7-11  -'=  II T-'-\\  -'  s  Dif„(cr ,,CT;)  -  Vi  •  II  (£::^:2^::^::)|| , 
To  solve  (3.8)  we  make  the  identifications  x  ■  [^:1^:], 

■^:('4u+£:i)-(A:2  +  £::)/?: 
i:(5ii+Fj-(fl::+£::)/?: 

-F-, 


Tx 


and 

\L-E  -R-' 

Using  Kronecker  products  to  express  the  linear  operator  T  in  (3.10a)  yields 

\A,.+E,,y®I„^    -I„®(A22  +  E22)' 
iB,,+F,J^®l.     -/,®(5.:  +  F:,) 


(3.9a) 
(3.9b) 

(3.9c) 

(3.9d) 

(3.10a) 
(3.10b) 

(3.10c) 
(3.11) 


Swapping  the  first  n.n;  columns  with  the  last  n./ii  columns  and  negating  the  whole  matrix, 
none  of  which  changes  its  singtilar  values,  yields 
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which  we  recognize  as  the  matrix  whose  smallest  singular  value  is  defined  as 

Dif„(A2,+£,,^,i+£,,-^,,  +  /-,;3ii+f  J  s  Dif„(A,2^ii;522^ii)  -  ^-  II  (£'ii^22^ii,/'2:)||£ 

The  quantity  D\f^{A22^ n'yB -2J3 ,_J  does  not  generally  equal  D\f^(A,^i^22''^ iiy^ii)  (u^ess  the 
All  and  5„  are  symmetric).  In  the  interests  of  retaining  our  coordinate  fre«  formulation  of  our 
bounds,  we  therefore  define 

Dif,(a,,a;)  -  Dif„(>l:;^,;;fl::^  J 

where  the  fact  that  Dif;  depends  only  on  cr^  and  cr,  follows  just  as  for  Dif^.   This  leads  us  to: 

Theorem  3:   Letp,  q,  Dif^,  and  Dif,  be  defined  as  above.   Then 

rain  (Dif  Ja;,(7;)  ,  Dif.(cT;,CT:)) 

diss(o-i  ,  CT.)  ^ ; : — -:; . 

(2  0"  +  q'))"^  +  2-max  (p,q) 

Suppose  that  ||  (£^)||£-  is  less  than  this  lower  bound  on  diss(a-i  ,  cr-),  and  define 


(£^)IL 


(2(p'f^-))^-  +  2max(p.g) 
nun  (Dif„(cr.,cr;)  ,  Dif;(CT.,cr2)) 


<  1    . 


Further  define  p^p  and  q^^:  to  be  the  norms  of  the  projectors  onto  the  left  and  right  deflating 
subspaces  of  the  perturbed  pencil  {A  +  E)-\{B  +  F).   Then 


^  -)     (1+^)' 
l-x- 


and     q,,  <  2  ■  -^^±4^  ■  q 
l-x* 


Proof:  We  need  to  make  two  different  choices  of  P  and  Q  in  (3.5).  Both  lead  to  the  same 
lower  bound  on  diss(c7|  ,  a;),  but  only  the  first  will  lead  to  a  bound  on  p^r,  and  only  the 
seco-^d  to  a  bound  on  q^f.  The  first  choices  of  P  and  Q  are 


and 


'^  ^ 

p-h.      0 

0     ^": 

. 

0           P--In, 

'"'    ""1 

P"h.         0 

0     '«: 

0     p-'-h^ 

Q  = 


With    this    choice    of    P    and    Q,    it    is    easy    to    verify    that    the    E.j    in    (3.6)    satisfy 


£• 


::ll£ 


P-II^I 


£' 


|f-!i£ 


'?-ll£l 


E> 


£.. 


p-\\E\\^,    and    ||E:-| 


<7-||£||r.     The 


JlFyllf  satisfy  analogous  inequalities. 

Gsnsider  equation  (3.7)  and  the  corresponding  identifications  in  (3.9).    Substituting  in 
these  bounds  on  jlfyllf-  and  ||£,y||£-  yields: 

||r-'||  --  s  Dif,(a„cT.)  -  (2-(p:  +  q')Y'^  ■  |1(£^)|U    , 
Ikll  ^<7-||(£^)||^    , 


and 


14,1!  ^p  •||(£^)||, 
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From  Lemma  1,  we  see  that  as  long  as 

JLiiL_MI< p;-\\(E,F)V 


4     • 


or,  solving  for  IKE,/')!!^, 

(2  (p-  +  <7-))"  +  2  ipq)- 
then  we  can  solve  equations  (3.7)  for  L|  and  R  .  Furthermore,  assuming  ||  (EJ^)\\^  is  less  that 
the  lower  bound  on  diss(c7;  ,  cr-J  in  the  statement  of  the  theorem,  we  see  from  Lemma  1  that 

UL,Ji,J\\s<2-\\8\\  -WT-'-W 

2q\\(Ejr)\\^ 


Dif>i.<T:)-  (2(y'  +  q')yH(EJ')\\E 
x-2-^-Dif„(a,,a-) 


(2(p'  +  q-))--  +  2wax(p.q) 


Dif.(ci:,cT:)  -  {2-(p'  +  q-))^' 


Dif,(a;,a:) 


(2(p-  +  <?-))-  +  2max(p,q) 


-Hi. 


max  {p,q) 


^  X  <  I    . 


Now    consider    equation    (3.8)     and    the    corresponding    identifications    in    (3.10). 
Substituting  in  these  bounds  on  llEyllf-  yields: 

||r--i|--s  Dif,(a;,a:)  -  {2(p-  +  <?-))"  ■  UE,f)\\^   , 


\g\\  ^P-  UErE)\\ 


E    I 


and 


il4.ll   ^q-UE,F)\\,    . 
Using  Lemma  1  as  before,  we  see  that  if 

Dif,((T.,CT;) 


l(E,E)||^< 


(3.13) 


(2(p'-  +  q')y''  +  2(p-qy''-'^    ' 

±en  we  can  solve  equations  (3.8)  for  L;  and  J?-.  We  can  also  bound  |1  (I:,^2)||£-  as  above 
yielding 

max  (p,q) 
This  implies  that  /"^jr  and  Q^^  are  invertible,  since  if 

-1  - 
^^^  ~    L^     h 

then  Pcf  must  equal 
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Pef  = 


QIf'  = 


-^2    ^«. 


R^    L 


0         (i„+L,L.y'- 

which  clearly  exists  since  ||L,||  <  1  for  (  =  1,2.   Similarly, 

\I„  +  R.Ji,)-'  0 

0  iI„+RJi.r- 


is  invertible.  Therefore,  (A+E)-\(B  +  F)  is  block  diagonalizable  by  P  P^jr  and  Q  Qe;:.  This 
means  that  the  spectra  of  the  new  diagonal  blocks  in  (3.6)  must  be  disjoint,  since  otherwise 
we  could  change  E  and  F  infinitesimaUy  and  destroy  the  block  diagonaRzability.  This  is  true 
as  long  as  ||(£,F)i|£  satisfies  the  bounds  in  (3.12)  and  (3.13),  implying  that 

min(Dif.(a.,cr-)  ,  Dif„(ari,CT2)         min(Dif;(CT.,cr:)  ,  Dif„(cr,,a-) 

diss(CTi  ,  CTt)  s  '■ — - — ^^ ^—  a — z. ^ — ' — 

'    -  '     -'  (2  (p-  +  q''))-  +  2  ip-q)-  (2  (p-  +  q'))-  +  2  m20i{p ,q) 

This  proves  the  first  part  of  the  theorem. 

To  compute  a  bound  on  p^^r  we  proceed  as  follows.  First  note  that 


-X     I 


0 


so  that  if  ||Xi|£<l  and  ||  >'||£<1  we  can  estimate 


k( 


X    ' 


(l  +  max(||XH£,  \\Y\\,))- 
l-\\X\\,-\\Y\\r 


(3.14) 


Since 


max{\\L,\\s,\\L:\\s,\\R.\\E,\\R:\\r)^x<  1   , 


we  can  apply  (3.14)  and  Lemma  2  we  get 


PEjr  ^    KiPP^jr)   5    K{P)-K{Psf)    ^    2   ■  p 


as  claimed. 


To  get  a  bound  on  q^^  we  repeat  this  entire  argument  using  the  following  choices  for  P 


and  Q: 


P  = 


and 


Q  = 


0    /, 


0     /, 


0        q--I„ 


r-L 


0 


0 


Q.E.D. 

We  record  another  result  of  this  approach  here,  because  we  will  need  it  in  the  next 
section.  In  it  we  use  a  measure  of  the  distance  between  two  subspaces  P;  and  P;  of  equal 
dimension,  the  maximum  angle  9n„(Pi.P;),  which  we  define  in  terms  of  the  (acute)  angle 
between  nonzero  vectors  x  and  y  (9(x ,>-)): 
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«a^x(P:.P:)  -  ni.^  e(^:>J^:) 


*,i«0 

Lemma  7:  Let  ||  (£^)||£  satisfy  the  constraint  in  iht  statement  of  Theorem  3.  Let  P  be  the 
left  deflating  subspace  o(  A-XB  belonging  to  j;,  and  let  Pfjr  be  the  corresponding  left 
deflating  subspace  of  (A +  £)- X (5  + F).   Then 

en.«(P.P£/^)  ^  arctan(xO'  +  ^'^P^))    ■ 

If  x<  1/2  we  can  improve  this  bound  to 

e„„(P,P£jc)s  arctan(2-j:/p)    . 

Similarly,  if  Q  and  Q^r  are  right  deflating  subspaces  oi  A-\3  and  {A  +  E)-\{B  +  F), 
respectively,  corresponding  to  a-,  then 

9n«(Q.Q£f)  ^  arctan(j:(<7  +  ^^-l))    . 
K  j:<1/2  we  can  improve  this  bound  to 

Ona.CQ.Qr-O  ^  arctaii(2  ■  x  /  q)   . 

Proof:  We  do  only  the  case  for  the  left  deflating  subspaces;  the  right  one  is  analogous.  The 
first  n^  columns  Ln 


P  = 


0    /. 


P'-'h, 


0 


0     p--i„. 


[(/+L.L:)--] 


span  P,  or  equivalently  [I„  \0Y  spans  P,  and  the  first  n^  columns  of  PP^f  {P^f  as  in  the  proof 
of  Theorem  3),  or  equivalently 

-p-^'-L, 

spans  Pr^.  Postmultiplying  this  set  of  columns  by  {I+LiL2)-(j>''^-I„  -  p'^'^-L-Lj)''  does  not 
change  their  span,  and  yields 

It  is  easy  to  show  that  the  maximum  angle  between  two  spaces  spanned  by  [/|0]^  and  [I\ZY  is 
arctan(||Z||)  [Stewartl],  so 

9^(P,P£^)  s  arctanCll  -p-'-L.Qn.^  "  p-'-LL,)-'^) 
^  aictan(p-'^-x/ il-p-'-ip--!)^"')) 

^aicxan(x{p  +  (p^ - 1)'"'))   . 

Ux<y2,    1  -  II p"--Z,-Z,; II £>  1/2  and  the  result  follows.    Q.E.D. 

To   compare   the   strength  of  the  results  in  Theorems  2   and  3   we   need  a  lemma 
comparing  Dif^,  Dif,  and  Dif;^: 
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Lemma  8: 


max  (Dif„((T.,(T2),Dif,(a;,cT;)  ^  Difx(c7,,CT;)    . 

Proof:  We  prove  the  result  for  Dif„.  Tlie  result  for  Dif,  follows  from  the  symmetry  between 
Dif„  and  Dif,. 


inf 

II  "11 -Ml -1 


A—CV-B--SV 


inf 

kp+iJP-1 

IWl-iivil-l 


'  cvu*  A  y^  —  svu*  3 II 
A'-cvu*  —B--:Svu* 


=       inf 

Icp+^p-1 

iin-i. 


cPAy-sPBy^ 
A-,cP-B-sP 


inf 


\LA,-RB,, 
A-.L-B-R 


=  cr. 


"i      --    "j 

-L®B..    -L®A~. 


=  cr. 


Ay®I„     -I„®Al2 


=  o-„ 


(since     (X®Yy=X^<®Y^    and     since     there     exists     a    permutation    matrix    P     such    that 
P^iX®Y)P  =  Y®X) 


=  a 


n;n 


"/„®A,,    -aL®/^ 
L^B.,    -B^-^L 


=  Dif,(A,,^::;5;.3::) 


Q.E.D. 

Applying  this  lemma  and  a  little  manipulation  to  the  lower  bounds  on  diss(a;  ,  a;)  in 
Theorems  2  and  3  shows  that  the  lower  bound  in  Theorem  2  is  always  stronger  than  the 
lower  bound  in  Theorem  3: 

Theorem  4: 

Dif  jCT^.a;)  min  (Dif  jCT.,cr;)  ,  Dif-(cT.,cT:)) 

^■(p  +  q)  ^   {lip-  +  <?-))"  +  2max(p,^) 

To  see  how  much  larger  the  one  lower  bound  may  be  than  the  other,  consider  the 
example 
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A-\B  = 


e   1 


-t    1 


-X  • 

7  o' 

0   I 

= 

0    A22 

-  X 

0        fl;; 

•    1 

-e 

with  both  A .-Xfl.,  and  A-z-'KBzz  being  of  dimension  «.  Then  for  small  t  a  little 
computation  shows  that  the  lower  bound  of  Theorem  2  is  proportional  to  e"  and  the  lower 
bound  of  Theorem  3  is  proportional  to  t*""',  almost  the  square. 

Even  though  Theorem  3  provides  a  worse  lower  bound  on  diss(ori  ,  ct-),  the  analysis 
leading  up  to  it  allows  us  to  bound  the  cxindition  numbers  of  the  P  and  Q  matrices  in  (1.1), 
which  the  analysis  of  Theorem  2  does  not  allow. 

n 

Theorem  5:  Let  cr  =  |J  ct;  be  a  disjoint  parti ti oiling.  Let  p,  and  <7,  denote  the  norms  of  the 

;-; 
projectors  onto  the  left  and  right  deflating  subspaces,  respectively,  belonging  to  cr,.    For 

lsi<i>  suppose 

(2(pi  +  qf)y'-  +  2  ■  max  (p.,q^) 


\(E,F) 


<  1 


''       »^"'"^"^     min  (Dif„(c7.,a:)  ,  Dif,(a.,a:)) 
and  let  x  -  max  x,.  Let  P  and  Q  denote  matrices  such  that  P~'-{(A+E)-\(B+F))Q  is  block 
diagonal  as  in  (1.1).   Then  P  and  Q  can  be  chosen  so  that 


k{P)  s  2b 


l-x- 


max  p, 


and 


k(Q)  rs  2b  ■  li±4-  •  max  ^, 
1-x^       :sis4 

Proof:  Let  p.(EF)  and  q,(EF)  denote  norms  of  projectors  onto  left  and  right  deflating 
subspaces  belonging  to  ct,,  respectively,  of  {A  +  E)-\(B  +  F).  By  Lemma  2  and  Theorem  3 
we  have 

k(P)  rs  b  ■  max  p,{EF) 

1-s.i-s.b     ' 

:s  2b  ■  max •  p. 


2b 


1-x^ 


max  pi 


An  analogous  sequence  of  inequalities  show  that 


k{Q)  ^  2b  ■ 


1-x^ 


max  q. 


Q.E.D. 
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Requiring  that  these  bounds  on  k(P)  and  k(Q)  be  less  than  TOL  yields  the  condition  of 
Theorem  A  that  the  decomposition  (1.1)  be  stable.  This  completes  the  proof  of  Theorem  A. 

Note  that  by   Lemma  2  b-max  pi   is  essentially   the  condition  number  of  the  best 

conditioned  P  and  b-max  q,  the  condition  of  the  best  conditioned  Q  that  block  diagonalize 

A  —  \B.  Therefore  the  extra  factor  2(l+x)V(l-x^)  in  Theorem  3  indicates  how  much  the 
condition  numbers  ot  P  and  Q  can  grow  beyond  the  minimum  possible. 

We  note  that  if  we  specialize  to  the  standard  eigenproblem  {B  =  l  and  i|Al|:sl), 
Theorem  A  yields  essentially  the  same  results  as  derived  in  [Demmel2]  for  the  standard 
eigenproblem  [Demmel  and  Kalgstrtira]. 
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4.  Singular  Pencils. 

Just  as  our  analysis  of  reg^dar  pencils  began  with  a  triangular  canonical  form  (Lemma 
4),  we  also  begin  with  such  a  canonical  form  for  singular  pencils: 

L«mma  9:  [Van  Dooren3]  given  any  pencil  A-XB  there  exist  unitary  matrices  P  and  Q  such 
that  P~-{A-\B)Q  is  in  the  following  quasi-uppertriangtilar  form: 


p-'-iA-\B)Q  = 


^r.;-'>^r.i 


A,-XJ9, 


(4.1) 


where  A^—\B^  has  only  L,  blocks  in  its  KCF  (i.e.  all  right  minimal  indices,  hence  the 
subscript  r),  A,-\B,  has  only  Lj  blocks  in  its  KCF  (i.e.  all  left  minimal  indices),  A^^^-\B,,^ 
is  upper  triangtilar  and  reguJar,  and  each  *  is  an  arbitrary  conforming  pencil.  A  -XJ^,,  can 
be  chosen  with  its  spectrum  on  the  diagonal  in  any  order.  Further,  the  blocks  in  the  KCF  of 
A-\B  are  precisely  those  which  appear  in  the  KCFs  of  A^-X^ri  '^;~^.'.  ^^'^ '^^^^"^r^y- 
Suppose  now  without  loss  of  generality  that  our  pencil  is  in  the  form  of  (4.1): 


A-\B  = 


A,,   A-^' 

•5.. 

5-' 

0       A;; 

X  - 

0 

5:: 

(4.2) 


X5r,y,  and  A-,-,-X5-, 


where  A.^-XB^^  includes  all  of  A^-\B^  and  possibly  part  of  A^^^ 

contains  all  of  A.-\B,  and  the  remainder  of  A^,j-XJ^„.  In  particular,  we  assume  A^^-XB-^ 
and  A;2-X^::  contain  disjoint  parts  (a,  and  a-.^  respectively)  of  the  spectrtim  a  of  A-XJ.  If 
we  denote  the  numbers  of  rows  and  columns  of  A;,-Xfl,  by  m,  and  n,,  then  it  is  easy  to  see 
m^^n;  (with  equality  if  and  only  if  A^-\B^  is  null)  and  m^Sni  (with  equality  if  and  only  if 
A;-X5;  is  null).  We  want  to  blockdiagonalize  this  pencil,  the  upper  left  block  containing  a^ 
and  the  lower  right  block  CT;.  Equivalently,  just  as  for  the  regiilar  case,  we  seek  P  and  Q 
such  that 

p-'-{A-\B)Q  = 

with  A;;-X5;,  having  the  same  KCF  as  A;,-X5;,.  The  pair  of  reducing  subspaces  belonging  to 
(T.  are  spanned  by  Q_=[I„  \0Y  and  /*.=  [/„  |0]^  [Van  Dooren3].  We  seek  to  block  diagonalize 
this  pencil  by  choosing  Pi  =  [L^\J„  Y  ^^^  Qi~[^^^n  Y >  which  leads  to  the  equation 


A..     0  ■ 

5::     0  ■ 

0     A;: 

-  X 

0     5;: 

0     /„ 


Am-XJ..   Aj^-XB-^' 
0  A-,-X5.. 


'^^ 

R 

A. 

-XB  ■ 

0 

0 

^": 

— 

0 

^•.l 

-XB 

, 

or 


A,.R-LA.-^  =  -A, 


B   R-LB-,  =  -flo 


(4.3a) 
(4.3b) 


which  are  identical  to  (3.2a)  and  (3.2b).   Thus,  we  can  rewrite  (4.3ab)  just  as  in  the  reguJar 
case  as 


/„®A,.    -AT.®I 


coLR 

coU?' 

-colA^' 

colL 

-Z.  • 

colZ. 

- 

-coLflj 

(4.4) 


/„®5;.       -Bl-®I, 

This  is  a  set  of  Im-n^  linear  equations  in  nn-^  +  mm-_  unknowns,  the  entries  of  L  and  R.  Since 
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m,^n.  and  mi^/i;,  we  see  we  have  at  least  as  many  unknowns  as  equations  with  equality  if 
and  only  ii  A-\B  is  regular,  the  case  analyzed  in  part  3  of  this  paper.  When  it  is  singuJar,  it 
has  a  (nonunique)  solution  as  stated  in  the  following  lemma: 

Lemma  10:  [Van  Dooren3,  lemma  2.3]  (4.4)  is  a  consistent  set  of  equations  for  arbitrary  Ao 
and  Bj2,ii' 

1)  Ai^-XBi  has  no  left  minimal  indices  in  its  KCF, 

2)  A22~^B-2  has  no  right  minimal  indices  in  its  KCF,  and 

3)  (T^=a{A,,  —  \B,j)  and  cr2  =  a(A;;-X_fl;2)  ^^  disjoint. 

Thus,  Z„  is  of  full  rank  leading  us  to  define  E)if„  just  as  for  the  regular  case: 

Dif-(^u^:2^u^::)  -  <^nr..(2J 
with  the  same  trivial  consequence: 

Also  as  before,  Dif„  is  specified  merely  by  choosing  a^  and  CT;  =  a  — cT;,  permitting  us  to  write 
Dif^(cT|,CT;)  if  A-X5  is  known  from  context  or  even  just  Dif„  if  cr^  is  known  as  well. 

Given  a  space  of  solutions  of  (3.3),  we  will  choose  the  (LJi)  of  least  norm  since  it  leads 
to  P  and  Q  as  well  conditioned  as  possible.  Call  this  minimum  norm  solution  (Iq-^o)  ^^'^ 
denote  (1+ HLoII")"*  ^V  P  ^^  (l+ll^oll')^"^  ^y  9-  ^^^  ^  ^-^k  '^  specified  only  by  cr^,  so  are 
IlLoll,  \\Ro\\,p  and  9. 

We  follow  essentially  the  same  approach  as  for  Theorem  3  in  section  3:  Choose  P  and 
Q  (using  Lq  and  Rq)  to  satisfy  equation  (3.1),  leading  to  equation  (3.5).  Seek  f£/  and  Q^jc  of 
the  same  form  as  in  section  3  such  that  premultiplying  (3.5)  by  Pcf  and  postmultiplying  it  by 
Q^f  blockdiagonalizes  it.  As  before,  this  leads  to  equations  (3.7ab)  and  (3. Sab)  to  solve. 
Again,  we  wish  to  apply  Lemma  1.  This  time,  however,  the  linear  operators  T  in  (3.9a)  and 
(3.10a)  are  no  longer  square.  To  use  Lemma  1,  we  need  to  show  both  operators  have  full 
rank.  The  linear  operator  T  in  (3.9a)  is  represented  by  Z^  in  (4.4)  above,  which  Lemma  10 
showed  to  be  full  rank.    The  operator  T  of  (3.10a)  is  represented  using  Kronecker  products 


as 


The  next  lemma  shows  that  this  matrix  is  of  full  rank 


(4.5) 


Lemma  11:  Assume  S^-XT.  has  only  L^  blocks  and  regxilar  blocks  in  its  KCF,  that  5;-Xr2 
has  only  Lj  and  regular  blocks  in  its  KCF,  and  that  u,_  and  ct;  are  disjoint,  where  cr,  is  the 
spectnim  of  5,- XT,.   Then  the  system  of  linear  equations 


(4.6) 


'LS,-S-R 
LT-T-R 

has  at  most  one  solution.    Suppose  S,-\T,  is  m,  by  n,.  Then  this  unicity  of  solution  is 
equivalent  to  the  matrix 


Tl®I„^   -I„®T2 


(4.7) 


having  full  rank. 
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Proof:  The  assumptions  on  the  KCFs  of  S,-\Ti  imply  that  m^sn.  and  m-^n-.  Thus,  the 
matrix  in  (4.7)  has  at  least  as  many  rows  as  columns,  so  if  we  show  (4.6)  has  at  most  one 
solution,  this  will  imply  (4.7)  has  full  rank.  Qioose  P,  and  Q,  so  that  Pr'(Si~^T,)Q,  is  in 
KCF.  Call  the  blocks  on  the  diagonal  of  the  KCF  S,j-\T,j.  Then  (4.6)  decomposes  into  set  of 
independent  equations 

C, 
D, 


'Ji^ 


(4.8) 


If  we  show  (4.8)  has  at  most  one  solution  for  each  j  and  k,  we  will  be  done.  There  are 
several  cases.  First  suppose  Sj-\Tj-L,-  and  Sii-\T-^  =  Ll .  Then  it  is  easy  to  see  from  the 
forms  of  Lj-  and  L[.  that  (4.8)  is  essentially  triangitlar  in  that  we  first  solve  (4.8)  for  the  first 
column  of  ^',  then  the  first  column  of  I',  then  the  second  column  of  ^',  the  second  column 
of  L',  and  so  on.  Thus,  if  (4.8)  has  a  solution,  it  is  uniquely  determined.  The  second  case  is 
when  Sj-\T,j  is  a  Jordan  block  and  S-^^-XT^i^Ll.  In  this  case  we  may  solve  (4.8) 
successively  for  the  last  row  of  L',  the  last  row  of  /?',  the  next  to  last  row  of  L'  and  so  on. 
When  Sj-\Tj  is  a  block  with  an  infinite  eigenvalue,  we  return  to  the  column  by  column 
regime.  The  other  cases  are  similar,  except  when  both  Sj-\T.^  and  52i-xr;^  are  regular. 
Since  by  assumption  they  have  disjoint  spectra,  this  reduces  to  the  case  covered  in  section  3. 
This  proves  that  the  matrix  in  (4.7)  is  of  full  rank  under  the  conditions  stated  in  the  lemma. 
Q.E.D. 

This  lemma  justifies  the  definition 


Dil.{A..^..-^-.^yJ  -  CT, 


As  before  we  can  show  that  this  definition  is  really  coordinate  free  allowing  us  to  write 
Dif ,(cr. ,CT;)  when  A-\B  is  known  from  context  or  just  E>if,  if  cr^  is  known  as  well.  This  leads 
us  to  the  following  extension  of  Lemma  7: 

Theorem  6:  Assume  A -X5  is  in  the  form  (4.2).  Let  P  and  Q  be  the  left  and  right  reducing 
subspaces  of  A-\B  belonging  to  a-.  Then  if  {A-^E)-\{B  +  F)  has  reducing  subspaces  P^ 
and  Q^jr  of  the  same  dimensions  as  P  and  Q,  respectively,  where 

min  (Dif„(a.,a;)  ,  Dif.(a.,a.)) 


\(E,F)\\s  =  x 


where  x<l 


(2-(p-  +  q-))-  +  2-max  (p ,q) 
then  one  of  the  following  two  cases  must  hold: 
Case  1: 

9mxx(P.P£/-)  ^  3rctanix(p  +  (p'-iy')) 
and 

9a«(Q.Q£F)  ^  arctan(;c(<7  +  (<7— 1)^'-))  • 
If  j<l/2  then  we  liave  the  tighter  bounds 

9:n^,(P.P£5)  ^  arctan(2x  /  p) 
and 

e^„(Q,Q£c)  5  arctan(2x  /  <?)    . 

In  other  words,  both  angles  are  small,  bounded  above  by  a  multiple  of  the  norm  of  the 
perturbation  \\{EJ^)\\^. 
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Case  2:  Either 


9in«(P'Pff)  ^  ar«an( 


1 


V, 


m  ■  p  +  ip — 1)-- 


-) 


or 


On^TCQ.Qcj-)  ^  arctanC-^^ 


1 


-) 


■n  •<?  +  (<7— 1)" 

In  other  words,   at  least  one  of  the  angles  between  perturbed  and  unperturbed  reducing 
subspaces  is  bounded  away  from  0. 

Proof:  li  A  —  \B  were  regtilar,  the  proof  of  case  1  would  be  Lemma  7.    Since  A -X5  is  of  the 
form  (4.2),  P  and  Q  are  spanned  by  the  columns  of  [/„J0]^  and  [/„.|0]^,  respectively.  In  the 

course  of  proving  Lemma  7  we  assumed  that  V^f  was  spzinned  by  the  columns  of 


-p'-Lj 


(4.10) 


First  we  will  show  that  if  this  assumption  is  false,  case  2  holds.  If  the  space  Pf^r  we  wish  to 
span  does  not  contain  a  vector  orthogonal  to  P  (in  which  case  case  2  trivially  holds),  then  it  is 
easy  to  see  that  P^r  can  be  spanned  by  the  columjis  of  [/[XT'  for  a  suitable  X.  There  are  two 
cases:  I— XL  is  singular  and  I— XL  is  nonsingxilar.  If  I— XL  is  singular  then  some  vector  in 
P£jr  is  also  in  the  space  spanned  by  the  columns  of  [l'"!/]^.  But  if  there  were  such  a  vector, 
then  the  maximum  angle  between  P  and  P£_r  would  be  at  least  arctan((;7*-l)~-'*),  which 
implies  that  Case  2  holds.  K  I-XL  is  nonsingular  we  claim  that  we  -an  choose  I,  to  make 
the  columns  of  (4.10)  have  the  same  span  as  [/[X^^:  choose 

Ij  =  -p~''{l-XL)-'-X 

which,  when  substituted  into  (4.10)  yields 

'l+L{I-XL)-'-X 
(/-XL)-'X 
The  top  block  of  this  last  matrix  is  nonsingular,  since 

/  0 


-X 

I 


I-XL  -X 

0       I  +  L{l-XL)-'-X 


L{I-XL)-^   I 
is  nonsingular.  Postmultiplying  by  the  inverse  of  this  block  yields  [/^T". 

Thus  it  suffices  to  consider  potential  new  left  reducing  subspaces  spanned  by  columns  of 
the  form  (4.10)  (and  new  right  reducing  subspaces  spanned  by  analogous  sets  of  vectors). 
This  leads  to  equations  (3. Sab)  in  the  same  way  as  in  the  proof  of  Theorem  3.  The  proofs  of 
Theorem  3  and  Lemma  7  apply  here  almost  verbatim.  The  essential  difference  is  that 
equations  (3.7ab)  and  (3. Sab)  are  no  longer  square.  Since  equations  (3.7ab)  are 
under  determined,  we  must  use  case  2  of  Lemma  1  to  estimate  when  they  are  soluble.  From 
Lemma  1  we  can  get  a  solution  of  (3.7ab),  but  we  lose  uniqueness.  Since  equations  (3. Sab) 
are  overdetermined,  we  must  use  case  3  of  Lemma  1  to  estimate  the  solution  when  one 
exists;  that  one  does  exist  is  the  assumption  that  reducing  subspaces  of  the  appropriate 
dimensions  exist.  The  two  cases  correspond  to  the  two  cases  in  (2.3)  and  (2.4)  of  Lemma  1. 
Case  1  follows  just  as  in  Lemma  7,  and  corresponds  to  the  bound  on  the  solution  in  (2.3). 

Case  2  corresponds  to  (2.4),  which  says  that  the  solution  of  (3. Sab)  satisfies 
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II4>II  •  llr^ll 


Dif,(or.,a-)  -  (2-(p-^g-)y''-^-UE,F)\\ 


E 


2-q-UE^)h 

Dif;(cT.,CT;) 


Dif,(c7;,c7:)  -  (,2-(j,-^q-))- 


(2(p-  +  (j-)y-  +  2max(p,q) 


X  ■  Dif,(c7,,a:) 
2<7- 


{2{p'  +  q')y-'-  +  2maxO,<7) 


TTius,  either  \\L^\\e  a  2--^/x  or  \\R-}\^  ^  Z'-'^/x.  If  111:115  2:  I'^-'^/x,  then  since  Pc^  is 
spanned  by  the  columns  of  (4.10),  one  can  see  the  lower  bound  on  IlL-lIs  translates  into  the 
following  lower  bound  on  the  tangent  of  the  largest  angle  between  P^jr  and  P: 

tan(9_(P,P,,))  ^   ^^^"'','^:''|,^  „        ^-     '    '   '■ 

1  +  P     Ml  ^11  •11^211        :■■-.■     .   .-■■•     ■ 

=  (¥^^  (p--i)")--s(v^-p  +  (p'-ir^^)-' 

as  desired.  If  ||^;|l£-  s  2""  /  x,  the  proof  is  analogous.   Q.E.D. 

One  way  to  interpret  this  lemma  is  as  a  nongeneric  perturbation  bound:  if  E  and  F  are 
small  enough,  then  for  each  pair  of  left  and  right  reducing  subspaces  of  (A-t-£)-X(5  +  F)  of 
the  same  dimensions  as  the  unperturbed  pair,  either  both  the  left  and  right  subspaces  of 
(A  +  E)-\(B  +  F)  will  be  a  small  angle  (bounded  by  a  multiple  of  the  norm  ||(£,/")||£)  away 
from  the  unpertiirbed  spaces,  or  else  at  least  one  of  them  will  be  bounded  away  in  angle 
from  the  unperturbed  space. 

Theorem  6  also  supplies  perturbation  bounds  for  algorithms  used  to  compute  the  KCF. 
These  algorithms  compute  decompositions  of  the  form  (4.1)  or  an  equivalent  form.  The 
algorithms  of  [Van  Doorenl],  [KagstrOml],  and  [KagstrOniZ]  among  others  are  all  stable  in 
that  they  can  produce  an  exactly  singular  pencil  (along  with  its  KCF)  near  the  pencil  A  —  \3 
supplied  as  input.  The  user  may  choose  how  near  this  exactly  singular  pencil  must  be  to 
A-\£  hy  varying  certain  thresholds  in  the  algorithms.  Of  course  if  the  pencil  is  square  there 
may  be  no  singular  pencil  within  the  distance  chosen  by  the  user,  which  the  algorithm  will 
then  report  (the  algorithm  may  unfortunately  fail  to  find  such  a  matrix  even  if  one  exists). 
The  algorithms  are  also  stable  in  the  sense  that  for  suitably  chosen  thresholds  and  most  input 
problems  they  will  retiuTi  pencils  with  the  same  singular  structures  in  their  KCFs  for  all  input 
pencils  sufficiently  close  to  A-X5.  These  properties  are  discussed  at  length  in  the  references 
at  the  begiiming  of  the  paragraph. 

Therefore,  one  can  take  the  estimate  of  Theorem  6  and  apply  it  to  analyzing  the  error 
of  standard  algorithms  in  the  following  algorithm: 

Algorithm  1: 

1)  Reduce  a  pendl  A-\B  to  A' -\3'  of  the  form  (4. 1)  using  a  standard  algorithm.   Let 
5  be  a  bound  for  the  perturbation  the  algorithm  makes  in  the  pencil  in  order  to  reduce  it 
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(S  is  computed  by  the  algorithm).  Let  P'  and  Q'  be  left  and  right  reducing  subspaces  of 
A'-\B'  corresponding  to  ct;. 

2)  Compute  the  bound 

_   min  (Dif Jg^,g;)  ,  Dif,(cr,,CT-)) 
(2(p-  +  q'-))'-  +  2max  {p  ,q) 
in  the  statement  of  Theorem  6. 

3)  If  S  <  A,  then  suppose  A"-XB"  is  a  singtilar  pencil  within  distance  ,v  <  A-8  of 
A-\B  with  right  and  left  reducing  subspaces  P"  and  Q"  of  the  same  dimensions  as  P' 
and  Q',  respectively.  Then  one  of  the  two  cases  holds: 

Case  1:  "'"   " 

e^(P',P")  ^  arctanC-^  •  (p  +  (p'-iy'')) 


and 


9^„(Q',Q")  ^  arctan(^  •  (<?  +  (<?-- 1)"))    , 


with  better  bounds  if  A"-\B"  is  within  (A -  S)/2  of  A  -  XJ . 
Case  2:  Either 


0^^,(P',P")  a  arctan(-T \  .     ^^.,J 

V  n 


J_ 

'm  ■  p  +  (p'-iy- 


or 


9,„(Q',Q")^arctan(-^ ^\  ■     ,,-)    • 

yn  ■  q  +  (q'-l)-' 

4)  if  5  >  A,  no  perturbation  bound  can  be  made,  because  the  perturbation  5  made  by 
the  algorithm  is  outside  the  range  A  of  our  estimate. 

Now  let  us  interpret  these  results  for  the  problem  of  computing  controllability 
subspaces.  Let  C(C,D)  denote  the  controllability  subspace  of  the  pair  of  matrices  (CJD).  As 
discussed  in  [Van  Dooren2],  this  subspace  is  simply  the  left  reducing  subspace  corresponding 
to  (7^  =  0  of  the  pendl  A-\B  —  [DJC-Xi].  It  is  easy  to  see  that  this  pencil  can  have  neither 
infinite  eigenvalues  nor  ij"  blocks  in  its  KCF  since  fl  =  [0|/]  has  full  rank;  hence  it  can  only 
have  finite  eigenvalues  and  Lj  blocks  in  its  KCF.  Also,  one  can  see  that  the  number  of 

Lj  blocks  is  a  constant  equal  to  the  number  of  columns  of  D.  Thus,  the  assumption  in 
Theorem  6  about  the  perturbed  pencil  having  reducing  subspaces  of  the  same  size  is  implied 
by  the  assumption  that  the  perturbed  system  {C  +  Ei^,D  +  Eq)  has  a  controllability  subspace  of 
the  same  dimension  as  C(C,£)).  Also,  the  algorithms  one  used  for  this  problem  take 
advjmtage  of  the  special  form  of  B-[0\I]  by  operating  only  on  A  =  [D\C]  and  so  making  no 
perturbation  in  B;  thus  we  will  assume  F  =  0  [Van  E)ooren2].  Also,  as  we  wUl  see,  /?;  and  Lj 
are  quite  simply  related,  allowing  us  to  prove 

Corollary  4:  Suppose  that 

dimiCiC +  EcJ^  +  Eo))  =  dim(C(Ci))) 
and  that 

lUr    r  Ml  min  (Dif  J(T:,a:)  ,  Dif,((7.,a.)) 

\\(Ec'£d)\\e  =  ^  ■  -;:r—- ^TT^ ; ; T    '      ''herex<l    . 

{2-{p-  +  q-))--  +  2-max(p,q) 
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Then  either 
Case  1: 

with 

if  j:<1/2,  or 
Case  2: 


9^,(C(C^),C(C  +  £c^  +  ^d))  ^  arctan(2.c/;7) 


9^,(C(C4?),C(C  +  £c^+£^o))  ^  arctanC-^ ^  ,     ,,„J    • 

Vm     p  -r  (p — 1)-- 

Proof:  It  suffices  to  prove  ±at  |ii?:ll5  =  ||i:ll  r.  This  follows  from  (3.8b)  and  the  special 
form  of  B  and  the  fact  that  F  =  0.  In  fact,  (3.8b)  implies  that  [OlLj]  =  R2,  from  which  the 
equality  of  their  norms  follows  immediately.  Q.E.D. 

Algorithm  1  clearly  also  applies  to  algorithms  for  compute  the  controUability  subspace. 
It  is  also  easy  to  see  that  these  results  apply  immediately  to  observability  subspaces  as  well  by 
duality  [Wonham]. 

Another  feature  of  a  control  system  (C  ,D)  for  which  we  can  derive  perturbation  bounds 
using  this  approach  is  the  spectrum  of  the  regular  part,  also  called  the  input  decoupling  zeroes 
or  uncontrollable  modes  of  the  control  system. 

Theorem  7:  Suppose  that  Case  1  of  Theorem  6  holds.  Suppose  further  that  the  block 
A---\B-_2  is  regular.  (This  implies  A-\B  has  no  L[  blocks  in  its  KCF.)  Then  the  spectrum  of 
the  perturbed  pencil  {A->rE)-\{B  +  F)  includes  the  spectrum  of 

where 

il(^'::.^'::)ll5^^-'?-il(f.^)ll£  ■ 

Similarly,  if  we  instead  assume  A^-XB-^  is  regular,  then  the  spectrtun  of  the  perturbed 
pendl  (A  +  £)-X(5  +  f )  includes  the  spectrum  of 

(A,i+£',J-X(5.,+F',J 

where 

Proof:  From  (3.6)  we  see  that 

£'■1  =  £.;-!.£-.    , 

E,  •■'<  ^^  E.'^t     L-^£^'i    , 

^'n  =  Fi-L-/z^   ,and 

£'22  =  E22-L2F.2     . 

Substitute  in  the  bounds  on  ||£(y||  and  ||£,y||  found  there  along  with  ||Z,(||  :s  1  in  these 
equations.  Q.E.D. 

We  may  now  use  estimates  from  section  3  of  this  report,  [Stewart2],  [Stewart3]  or 
anywhere  else  to  bound  the  perturbations  Ln  the  spectrum  of  the  pendl.   In  the  case  of 


30 


uncontrollable  modes  of  a  contrc!  system  this  theorem  implies 

Corollary  5:  Assume  we  are  in  Case  1  of  Corollary  4.   Then  the  uncontrollable  modes  of  the 
perturbed  control  system  (C  +  E(;^D  +  £^)  are  the  eigenvalues  of  the  pencil 

(A22+£"::)-X5-2 

where 


\E 


In  [Van  Dooren2],  P  ind  Q  are  chosen  in  (4.1)  so  that  P~-BQ  =  [I\0],  implying  5;:=/. 
Thus  the  problem  of  findlrig  the  eigenvalues  of  the  perturbed  pencil  (A;;  +  £'22)~^:2  above 
reduces  to  perturbation  theory  for  the  standard  eigenproblem. 
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5.  Namerical  Examples.  -'    ^   '       -   -' 

Example  1:  Consider  the  regular  penal  of  the  introduction' for  ti  =  10"*: 

1  0    0  ■ 

A-kB  =000       -  X 

.0  0   10-^ 

First  we  consider  the  partitioning  of  the  spectrum  ct  of  this  pendl  into  <7i=={10^},  ctt^jI},  and 
(73= {IQ--}.  We  wish  to  know  for  what  values  of  €  and  TOL  this  decomposition  is  stable. 
From  the  definitions  of  Dif^  and  E>if,  in  section  3,  we  see  that 

Dif„(c7.,cr-CTj  =  .998  =  Dif;(cr.,a-crJ    , 
Dif„(cT;,a-cr:)  =  flO"'  =  Dif,(cr;,c7-{T,)    ,  and 

Dif^(CT3,cr-a3)  =  510-*  =  Dif,(cT3,cr-ar3)    . 

It  is  also  easy  to  see  that  p;  =  <7,  =  l  for  j  =  1,2,3.  Therefore,  from  (1.3)  and  (1.4)  of  Theorem 
A  we  see  that  if 

e  <   ^"^?       =  1.25-10-^ 
4 

and 

2.3.  Xl±l±10^  <  TOL 
l-(£-8-105)- 

then  the  decomposition  ct  =  ct_IJ  cr-lj  a3  will  be  stable.  For  example,  if  we  choose  TOL  =  100, 
then  e  has  to  satisfy 

e  <  1.M0-* 

for  stability. 

If  we  set  TOL  =  =0 ,  then  since 

Pif;,(cr.,cT-or/)  =  .99999 

Dif^((T,,cr-a;)  =  7.0710-* 

Dif^((T3,a-CT3)  =  7.07-10-* 

we  see  from  (1.5)  that  the  decomposition  cr  =  cr.|Ja-|Ja3  will  be  stable  if 

e  <  2.5-10-*   . 

From  Corollary  2,  we  see  that  these  upper  bounds  on  e  could  not  possibly  be  any  larger  than 
7.0710-*,  since  a  perturbation  of  that  size  would  make  the  eigenvalues  0  and  10"^  coalesce. 

If  t>2.5-10"*,  then  Theorem  A  no  longer  guarantees  the  the  eigenvalues  at  0  and  10"^ 
cannot  coalesce.  In  this  case,  we  consider  the  decomposition  cr  ={10^}  and  a;  =  {0,10"*}. 
Now  the  quantities  we  need  to  apply  Theorem  A  are 

Dif,(«T,,cr2)  =  Dif„((T:,aJ  =  Dif,(a;,CT;)  =  Dif,(a2,CTj  =  .998    , 

p,  =  qi=l,  and 

Dif,(<7.,c7;)  =  .99999   . 

From  (1.3)  and  (1.4)  we  see  a  =  a.lja-  is  stable  if 
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€  <  .2495 


and 

2-2.   (^■^^■^^■-f   <  TOL    . 
l-(4.008€)- 

For  example,  if  TOL  =  100,  then  the  decomposition  is  stable  if 

£  <  .2303 

If  we  set  rOL  =  '=,  then  by  (1.5)  cr  =  CT;|J ''':  i^  ^^t)'^  i^ 

t  <  .3535   . 

From  Corollary  2  we  see  that  these  upper  bounds  on  epsilon  can  be  no  larger  than  99999, 
since  a  perturbation  of  that  size  would  make  the  eigenvalues  10^  and  10"^  coalesce.  If 
t>.3535  Theorem  A  cannot  guarantee  that  a  can  be  decomposed  at  all. 

Exaimple  2:  We  consider  the  singiJar  pencil 

10-^   1-X 


A-kB  -  [D\C->J]  = 


0 
0 


0 

0 

2-X 

0 

0 

3-X 

This  pencil  is  in  the  canonical  form  (4.2)  with  m.  =  l,  m;  =  2,  n.  =  2  and  n2~2.  The  left 
reducing  subspace  P  which  is  spanned  by  [1,0, 0]'"  is  the  controOability  subspace  C(C,D)  of 
the  system  (C,£>). 

The  quantities  of  interest  in  Corollary  4  are 

Dif,(0,{2,3})  =  Dif.,(0,(2,3})  =  .112   , 
and  p  =  q  —  \.  Thus  Corollary  4  implies  that  if  E(-  and  Ep  are  such  that 

1  =  dim(C(C,£>))  =  dim(C(C  +  £c^  +  ^o)) 


and 

then  either 
Case  1: 

or 
Case  2: 


{Ec,Ep)\\£  =  .028- X  where  j:<1 


8^„(C(C,£))  ,  C(C  +  £c^  +  ^d))  ^  arctan(x) 


e„„(C(C,£))  ,  CiC  +  Ec,D  +  Er,))  ^  arctan(l  /  V3)  =  arctan(.577)  =  .52   . 

Thus,  for  example,  if  we  have  1|  f^,^^)!!  £•  <  .001,  then  any  one  dimensional  controllability 
subspace  of  {C  +  E(^J:>  +  Eq)  will  either  be  within  .036  radians  of  C(C JD)  or  at  least  .52 
radians  away  from  C(C,Z?).    A  simple  example  of  the  latter  situation  is 

0      1-X      0         0 
10-5      0      2-X      0 
0         0         0      3-X 

in  which  case  the  new  controllability  subspace  is  spanned  by  [0,1,0]^  and  so  is  orthogonal  to 
C(C,D). 

In  case   the   perturbed  controllability   subspace   falls   into   Case   1,   then  we  can   use 
Corollary  5  to  bound  the  uncontrollable  modes  of  the  pertiirbed  system:  the  perturbed  modes 


[D  +  Eo\C  +  Ec->J]  = 


33 


are  eigenvalues  of  the  matrix 


2   0 
0   3 


where  ||£"||£  s  ^l-\\{Ec,Er))\\r.  Gershgorin's  theorem  supplies  the  simple  bound  that  the 
peituibed  uncontrollable  modes  lie  in  disks  centered  at  2  and  3  with  radii 
^•Il(£c^o)li£^^-04. 
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